All Questions
Tagged with algorithmsdata-mining
36 questions
3votes
1answer
34views
Name of algorithm that maps a string column to a float column, based on an aggregation with another float column , similar to TF-IDF
The Question I'm not super familiar with the name's of common algorithms in Data Science, and I feel like this would be something that is commonly used, and so should have a name - want to refer to ...
1vote
1answer
202views
How to predict what someone will order?
Suppose Prof. X goes to a road side tea-coffee shop everyday at 5pm just after his office. After reaching there he tosses a coin, and places his order tea or coffee. The shop owner Y has been ...
5votes
4answers
8kviews
What is the difference between AI, ML, NN and DL? [closed]
What is the difference between the following four categories: Artificial Intelligence (AI) Machine Learning (ML) Neural Network (NN) Deep Learning (DL) Data Science My current understanding is that ...
1vote
2answers
707views
How do I select the "best" unsupervised machine learning algorithm to cluster my specific dataset?
I want to cluster a dataset without prior knowledge on the correct amount of clusters. For different algorithms (i.e. k-means, gmm...) I can iterate through different values and try to find the best ...
0votes
1answer
76views
kMean clustering for recommendation
I have a file with 50000 rows from a library platform. Each individual row saves a user, and shows the order in which the user, has selected. The books could be from various categories (e.g. roman, ...
0votes
1answer
130views
Which algorithms should I use for identifying similar characteristics between data points (the intersections)?
I am working with a dataset that has been coded and categorized, so that each datapoint has a set of coded characteristics. An example data point would be something like the following: Example Data ...
1vote
1answer
33views
Need some advice on approach to select only the informative emojis from the data set?
I have a giant data set from a local elections, which contains hashtags, emojis, and comments. I wanted to make a network analysis using only emojis. So far I have a network analysis graph made in R ...
0votes
1answer
72views
What is the output polytree after aplying the Ramex algorithm to this graph?
I've been trying to understand the way this algorithm works, but I can't get a consistent result. It has two phases: the first one coverts a table of events into a graph, and the second where the ...
1vote
0answers
148views
Is the Apriori algorithm suitable for database tuples?
Problem description: I need to use an association rule algorithm that lets me use database tuples and I think Apriori is a good option, but I am not sure. Starting point: I know the Apriori ...
2votes
1answer
164views
Efficient way to search list of items in a text document
I have a list of items (size ~50K) and several documents( average page per document ~10). I am trying to find what all items are listed in each document as follows : ...
2votes
1answer
69views
Finding a Data pattern [closed]
I am new to this data science field. I have data of points in 3D space and each point "helps" a metric. I have the sets of points and corresponding metrics. Data might look like: ...
1vote
1answer
301views
Predict ratings for Item Based Collaborative Filtering
Given the (cosine) similarity score of top 100 neighbors of every item, how do I predict ratings for unrated items? Please explain in simple terms. Item 1 260 0.577305 780 0.5655413 1210 0....
1vote
0answers
505views
Pattern recognition in 2D dataset [closed]
I'm not familiar with the ways and tools of data scientists but I have a background in software engineering and mathematics. I'd need to recognize patterns in a two-dimensional data set. I have a lot ...
1vote
2answers
632views
Shape of a distribution as a feature
How can I use the shape of a distribution as a feature in machine learning ? Do I use something like the standard deviation ?
1vote
1answer
39views
Implementation of reliable rule learning
I want to perform "reliable rule learning", i.e. mining a set of rules with a very low number of false negatives. I recently read the paper "Reliable agnostic learning" by Kalai et al. (https://doi....